GLU Implementation Architectures for Heterogeneous Systems
نویسنده
چکیده
While a heterogeneous system ooers substantial aggregate processing power to exploit parallelism in GLU programs, its processors and links exhibit uctuating loads and can fail arbitrarily. On such a system, eeective exploitation of coarse-grain parallelism requires the runtime strategy for load distribution to adapt to system dynamics. In this paper, we describe two eduction-based architectures for implementing GLU on heterogeneous systems that use diierent approaches to cope with dynamically uctuating loads, processor failures, as well as processor additions. The rst implementation architecture, which is based on multiple distributed generators, deals with uctuating loads as well as processor changes using an adaptive value-demand distribution function. The second implementation architecture, which is also based on multiple distributed generators, uses dynamic inter-generator term-demand migration to adapt to uctuating loads as well as processor changes.
منابع مشابه
AutoMatch: Automated Matching of Compute Kernels to Heterogeneous HPC Architectures
HPC systems contain a wide variety of heterogeneous computing resources, ranging from general-purpose CPUs to specialized accelerators. Porting sequential applications to such systems for achieving high performance requires significant software and hardware expertise as well as extensive manual analysis of both the target architectures and applications to decide the best performing architecture...
متن کاملVariable Value Manager Suspended Function
eduction model. We postulate that this implementation can potentially ooer superior performance scalability when compared to the single-master implementations even when the granularity of parallelism is not very coarse. Immediate future work will focus on evaluating the performance scalability of the new implementationus-ing GLU applications with modest granularity of par-allelism. In the long ...
متن کاملTowards a multifrontal QR factorization for heterogeneous architectures over runtime systems
During the last decade, computer architectures for high performance computing have considerably evolved toward heterogeneous systems equipped with different types of computational units and a higher number of cores per chips. An example of popular heterogeneous architectures widely adopted in the high performance computing domain are GPU-based systems. In the work presented in this talk we stud...
متن کاملDesign and Implementation of a High Speed Systolic Serial Multiplier and Squarer for Long Unsigned Integer Using VHDL
A systolic serial multiplier for unsigned numbers is presented which operates without zero words inserted between successive data words, outputs the full product and has only one clock cycle latency. 
The multiplier is based on a modified serial/parallel scheme with two adjacent multiplier cells. Systolic concept is a well-known means of intensive computational task through replication of fu...
متن کاملAspectizing Multi-agent Systems: From Architecture to Implementation
Agent architectures have to cope with a number of internal properties (concerns), such as autonomy, learning, and mobility. As the agent complexity increases, these agent properties crosscut each other and the agent’s basic functionality. In addition, multi-agent systems encompass multiple agent types with heterogeneous architectures. Each of these agent types has different properties, which ne...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007